Multi-Level Compositional Reasoning for Interactive Instruction Following
نویسندگان
چکیده
Robotic agents performing domestic chores by natural language directives are required to master the complex job of navigating environment and interacting with objects in environments. The tasks given often composite thus challenging as completing them require reason about multiple subtasks, e.g., bring a cup coffee. To address challenge, we propose divide conquer it breaking task into subgoals attend individually for better navigation interaction. We call Multi-level Compositional Reasoning Agent (MCR-Agent). Specifically, learn three-level action policy. At highest level, infer sequence human-interpretable be executed based on instructions high-level policy composition controller. middle discriminatively control agent’s alternating between various independent interaction policies. Finally, at lowest manipulation actions corresponding object masks using appropriate Our approach not only generates human interpretable but also achieves 2.03% absolute gain comparable state arts efficiency metric (PLWSR unseen set) without rule-based planning or semantic spatial memory. code is available https://github.com/yonseivnl/mcr-agent.
منابع مشابه
Alignment-Based Compositional Semantics for Instruction Following
This paper describes an alignment-based model for interpreting natural language instructions in context. We approach instruction following as a search over plans, scoring sequences of actions conditioned on structured observations of text and the environment. By explicitly modeling both the low-level compositional structure of individual actions and the high-level structure of full plans, we ar...
متن کاملCompositional Reasoning for Multi-modal Logics
We provide decomposition and quotienting results for multimodal logic with respect to a composition operator, traditionally used for epistemic models, due to van Eijck et al. (Journal of Applied NonClassical Logics 21(3–4):397–425, 2011), that involves sets of atomic propositions and valuation functions from Kripke models. While the composition operator was originally defined only for epistemic...
متن کاملMulti-faceted evaluation of interactive remote instruction
Over the past five years, we have used our Interactive Remote Instruction (IRI) system to teach classes to various audiences. In this paper we evaluate IRI from three perspectives: technical performance, student perceptions, and instructional view. IRI scales well to multiple delivery sites and supports classes of 30 or more students, though some tools do not scale well. We compared perceptions...
متن کاملCompositional Reasoning for Pointer Structures
Canonical trace model of Hoare and He supports a satisfactory theory of graph properties. We use it to define a technique for the general composition of properties that extends the parallel-by-merge of Unifying Theories of Programming, and apply that to unique decompositions. Applications are provided to the fundamental concepts of acyclicity, reachability and canonicity; and those are used, in...
متن کاملCompositional Reasoning for Epistemic Logics
We provide decomposition and quotienting results for multi-modal logic with respect to a composition operator, traditionally used for epistemic models, due to van Eijck et al. (Journal of Applied NonClassical Logics 21(3–4):397–425, 2011), that involves sets of atomic propositions and valuation functions from Kripke models. While the composition operator was originally defined only for epistemi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i1.25094